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THE  PROBLEM 

Dstermining  the  relationship  between  two  categorical  or  qualitative  variables 
has  previously  been  possible  only  through  use  of  C,  the  contingency  coefficient. 

C  has  several  distinct  disadvantages^  csnong  them  its  lock  of  comporc^ility  to  the 
Pearson  product-moment  correlation  measures  of  relationship.  This  lack  of  compcra- 
bility  frequently  creates  problems  In  the  interpretation  of  contingency  coefficients 
and  the  generalization  of  results. 


FINDINGS 

A  technique  is  described  which  provides  an  extension  of  the  Pearson  correlation 
equation  to  the  case  of  two  categorical  varid^les.  Through  canonical  weighting, 
maximal  scale  values  are  assigned  to  categories  of  the  qualitative  variables.  These 
scale  values  create  an  optimally  weighted  composite  for  each  categorical  varidsle, 
and  the  correlation  between  composites  is  the  "K"  coefficient,  which  avoids  many 
of  the  disodvontoges  of  C,  ond  allows  the  computation  of  a  product<-morrM»nt  relation¬ 
ship  between  two  categorical  variables. 


INTRODUCTION 

Many  problems  In  psychology  must  of  necessity  deol  with  cotegorlcol  data.  In 
the  post,  when  an  investigator  has  hod  two  categorical  vorlobles/  and  wished  to  express 
the  relatlorahlp  between  them,  his  usual  recourse  was  to  compute  C,  the  contingency 
coefficient.  |t  is  well  documented  thot  C  hos  certain  dlsodwmtoges.  For  excmipte#  the 
ma^iltude  of  C  cannot  be  lnt«‘preted  os  Indlcatlrtg  the  some  decree  of  relotlonidtip  os  an 
ordinary  hxx’son  correlation  coefficient.  This  is  (^e«  In  part,  to  built-in  upper  limits 
of  C.  When  ^  number  of  columns  (h  )  equals  the  number  of  rows  (g),  then  the  upper 
limit  of  C  =  The  problem  of  Interpreting  C  Is  complicated  even  h^rther  when 

g  ^  h;  for  such  coses  the  upper  limit  of  C  is  unknown.  Even  for  a  2  x  2  contingency 
table  the  magnitude  of  C  does  not  equal  the  magnitude  of  p ,  which  Is  a  Pearson  equa¬ 
tion  and  which  can  readily  be  computed. 

PROCEDURE 


THE  CORRELATION  RATIO 


In  attempting  to  find  a  Peorson-type  substitute  for  the  contingency  coefficient, 
certoin  guide  lines  seemed  evident.  First,  as  mentioned  above,  the  technique  should, 
for  a  2  X  2  table,degenerate  into  the  four-fold  contingency  coefficient  (^).  SecorKiiy, 
the  technique  for  o  2  x  g  table  should  yield  the  some  result  as  if  we  had  asslgt>od  o 
value  of  **1 "  to  row  1 ,  a  "0*  to  row  2,  arxi  solved  the  problem  by  means  of  the  correla¬ 
tion  ratio  equation. 


The  raw  score  form  of  the  Pearson  product-moment  equation  is 


•^XY  ” 
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Wherry,  Sr.  (1)  pointed  out  in  1944  that  if  one  hod  categorical  data  for  variable  X  and 
interval  data  for  variable  Y,  and  If  orw  anigned  the  mean  Y-scors  of  those  individuals 
in  category  !  as  their  X‘~score  (these  are  known  as  the  Wherry  weights)^  i.e. »  X,  ^  = 

Z  Y/n,  /  then  the  Pearson  equation  could  be  shown  to  be  equivalent  to  the  correlation 
ratio  equation.  Usir>g  the  Wherry  weights^certain  values  in  Eq.  0)  con  be  rewritten. 


Thus, 
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[hlote  that  Eqs.  (2)  and  (4)  ore  equol.J 

SubstiMing  Eqs.  (2),  (3)/  ond  (4)  Into  Eq.  (1)/  ^  obtain 
9  n  N 

,  =  Ni(iY)Vn> -gy  _ _ 

f  y  V  /  M  N 

VNZ  (rY)Vn,  -(zy)*  Vnzy»-(zy)> 

But  the  numerator  Is  the  square  of  the  left  term  In  the  denominator;  therefore. 
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Equation  (6)  1$,  of  course,  the  correlation  ratio  equation  for  use  when  Y  is  a 
continuous  variable  and  X  is  a  categorical  variable  with  g  categories.  The  fact  that  the 
con’elalion  ratio  is  derivable  from  the  Pearson  equation  is  unfortunately  neglected  in 
most  stotistiool  texts. 

THE  -K-  COEFFICIENT 

Wheiry,  Sr.,  in  the  article  mentioned  above,  was  primarily  interested  in 
development  of  a  technique  for  using  qualitative  data  in  multiple  regression  techniques. 
An  alternative  method  for  using  qualitative  data  in  multiple  regression  studies  has 
recently  gained  prominence  in  the  literature.  This  technique  is  to  create  a  dichotomous 
variable  for  each  category  of  X.  Such  variables  have  been  referred  to  as  "categorical 
predictor  variables*  by  Bottenberg  and  Ward  (2)  and  as  ^pseudo  dichotomous  variables* 
by  Wherry,  Jr,  (3).  Thus,  If  an  individual  is  in  the  first  category  of  X,  he  receives  a 
*1  *  on  the  first  created  dichotomous  variable  and  a  *0”  on  all  other  dichotomous 
variables  representing  variable  X.  If  he  is  in  the  second  category  of  X,  he  receives  a 
*1  *  on  the  second  created  dichotomous  variable  otkI  a  *0*  on  all  the  other  created 
dichotomous  variables,  et  cetera. 

If  g  dichotomous  variables  ore  created  os  stated  above  and  these  variables  are 
used  to  predict  variable  Y,  a  continuous  varloble,  the  multiple  correlation  (Ry'X  Xg  ***) 

will  also  equal  r)  and  the  raw  score  beta  weights  will  equal  the  Wherry  weights  men¬ 
tioned  above.  It  is  also  of  interest  to  point  out  that  the  shrunken  multiple  correlotion 
R  will  equal  Kelly's  epsilon  (c),  the  unbiased  estimate  of  t;. 
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qp— Kjijiiiifniieim 


TKuS/  we  may  recognize  tfiol'  there  exists  a  method  of  assigning  numbers  not 
covered  in  the  usual  discussions  of  intervol,  ordlnoi,  and  nominal  scaling  technique. 
This  new  technique  we  will  refer  to  os  "maximal  **  scaling,  becouse  it  is  Hie  assignment 
of  the  set  of  numbers  to  a  set  of  mutuoily  exclusive  cotegortes  which  will  moximize  the 
Pearson  equation. 

if  one  hod  two  cotegoricoi  variables,  X  (with  g  categories)  and  Y  (with  h  cate~ 
gories),  one  could,  as  stated  above,  create  a  set  of  g  dichotomous  vurinbles  to  represent 
X  and  a  set  of  h  dichotomous  variables  to  represent  Y.  If  one  could  then  simultaneously 
solve  for  the  optimal  ("Maximal")  weights  f^  the  h  voriobles  and  the  optimal  wefgMs 
the  g  variables,  m  os  to  maximize  the  Pearson  equation,  one  could  have  a  sto^emertf  of 
Hm  mognihx^  of  relationship  between  two  cotegoricoi  variobtes  which  is  o  Pearson-type 
of  correlation  coefficient.  Fortunately,  once  one  has  computed  the  interreiaKonshIps 
among  the  two  se*s  of  dichotomous  variables.  Hotelling's  (4)  technique  of  oononieol  cor* 
relation  con  be  used  to  solve  for  the  maximal  we!(^ts.  The  resulting  Peorson  coefficiertf 
will  be  referred  to  as  the  "K*coefficient. "  The  use  of  the  cononical  correlation  tech** 
nique  for  scoring  categorical  data  was  first  mentioned  by  Fisher  (5)  in  1938,  and  hos  also 
been  reported  elsewhere  (6).  The  discussions  of  the  technique  are  usually  couched  In 
terms  of  matrix  algebra.  Because  both  matrix  algebra  and  canonical  analysis  are  not 
welt  known,  relatively  few  persons  would  be  able  to  compute  o  K^coefflcient  even  If 
they  had  the  above  references  available.  In  fact,  even  if  one  understands  the  cononicol 
technique  (Several  modifications  ewe  necessary  for  hcmdling  two  sets  of  categorical  vorl* 
ables.  For  the  above  reasons  it  seems  desirable  to  set  forth  the  necesKiry  steps  in  obtain* 
Ing  a  K'coefficient. 

AN  EXAMPLE 

Let  us  assume  we  desire  to  find  the  K*coefffcient  for  the  7x8  contingency  table 
shown  In  Table  I.  The  contingency  table  should  be  arranged  so  that  the  numbers  rows 
(g)  Is  equal  to  or  less  than  the  number  of  columns  (h). 

Step  1 .  - 

Compute  n,  N*n,  and  V n  (N*n)  for  each  row  and  column  of  the  contingency 
table  as  shown  in  Table  1. 


Step  2. 

Compute  ag  by  g  +  h  Intercorrelation  matrix  ming  the  equation 


r  =  N  ■  c,j*n,.  n, 

(N-nj)  V  n,  (N-n,) 


(9) 


where  c^j  =  the  cell  entry  in  Table  1  if  i  is  on  X*variable  and  j  is  a  Y*'variable, 
otherwise  let  C|  ^  *  0, 


Accurocy  should  be  maintained  to  the  sixth  and  seventh  decimal  in  computing 
the  matrix,  it  is  not  necessary  to  compute  the  Intercorrelations  among  the  Y*variables. 
The  results  of  step  2  are  shown  in  Table  II. 
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Th«  IntarcofralotlocK  Among  tho  g  X'Vorioblot  and  Tho  Intarcorrolotiont  of  Itw  g  X*Vs'IablM  ond  Hw  h  Y^Vorlobl 


^15 


St^  3. 

Perform  a  diagonal  (actor  cmolysls  (7)  extracting  not  more  than  g**!  factors.  It 
turns  out  that  a  maximum  of  g**l  factors  ore  needed  to  explain  all  the  variance  of  the 
g  X^vorlobles  since  each  of  the  X'Vorlabies  Is  mutuoily  exclusive  and  mutuotly  exhous* 
tive.  That  Is,  If  a  person  Is  found  In  one  row,  he  ccmnot  be  found  In  any  other  row, 
ond  If  a  person  Is  Included  In  the  analysis,  he  must  be  found  In  one  of  the  rows.  If,  by 
chance,  tfte  proportions  of  cases  fourd  In  each  column  are  Identical  for  two  rows,  the 
rows  i^Kwid  be  combined  since,  from  o  predictive  stisidpolnt,  there  Is  no  difference 
between  the  two  categories. 

The  results  of  the  diagonal  fector  aiKilysts  are  shown  In  Tcd>le  III. 


Step  4. 

Usirtg  the  h  loadings  of  the  Y*variobles  on  the  g"!  diogonal  foctors,  obtoln 
the  Interrelationships  of  the  Y*vorI(^les  In  the  X^spoce.  To  obtain  this  matrix,  use  the 
eryuatlon 

fg-1 


r..  - 


r  (Of 
1 


.Y. 


V.  > 


(10) 


where  ^ 

rj  j  Is  the  relationship  of  the  i  Y“varlable  to  the 

X'Space,  and 


Y"varl6ble  In  the 


Of  y  Is  the  diagonal  factor  loading  of  the  Y**varlable  on  the 
diogonal  factor. 


The  results  of  step  4  ore  shown  In  Table  IV. 

Table  IV 


The  Interrelationships  of  The  Y^Variobles  in  the  X~Space,  O'n  '’) 


Yl 

Y2 

Y3 

Y4 

Y5 

Y6 

Y7 

Y8 

Yi 

.04475 

-.04189 

.06056 

-.03410 

-.03684 

-.01188 

-.03146 

-.04721 

Yo 

.11155 

-.00051 

.10540 

-.04008 

-,00779 

.02698 

.01826 

Y3 

.09650 

-.07787 

.05546 

-.01101 

.05055 

-.05709 

.11701 

-.03471 

-.00843 

.03500 

.00442 

yI 

.03527 

-.00646 

.02961 

-.04246 

.01398 

-.00731 

.01512 

V 

.04095 

.03380 

.06986 
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step  5. 

Perform  a  Mnclpo!  Axis  factor  cmol/sis  (8)  extroctfng  only  the  first  Principal 
Axis.  This  step  obtains  the  vector  through  the  X"spoce  which  will  exploin  a  maximum 
amount  of  the  Y  vorionce  explotnobie  with  only  one  vector.  The  Principal  Axis  lead" 
ings  (opY  '<)  ihown  in  Table  V. 

Table  V 


The  Principal  Axis  Loadings  of  the  Y"Vciriabies,  (opy^  **) 


Yl 

Y2 

Y3 

Y4 

YS 

Y6 

Y7 

Ys 

P.A.  -.18127 

.29501 

-.30391 

.28791 

-.15924 

.01475 

.15021 

.15049 

Step  6. 

Obtain  the  relationships  (a  f  p's)  of  the  Arincipoi  Axis  to  each  of  the  h‘1  diag¬ 
onal  factors  by  solving  g-l  set  of  e<^ations  of  the  type 

%  Y^  *  P  ^  *  °f,  P  *  -  “fg-l  Y j  *  %-l  P  “  ® PYj  ' 


"f,  Y,  •  p  *  V.  *  \  I*  *  "•  *  V>  Y,  ■  %-l  I*  ‘ 


+  .  .  . 


*4*  '4' 

*  *  e  •  • 


■f.Yj.,  -“f/^-f.Yg-i  •'■f.P*™  ^  Vi  VI  ■  “nTj-r 


Notice  thot  only  g-1  of  the  available  h  Y**vcirlabies  ore  used  in  the  above 
simultaneous  solution.  A  consistent  solution  will  be  obtained  with  ony  g-1  of  the  Y~ 
variables  used.  For  simplicity  the  first  g-1  set  was  chosen.  Toble  VI  shows  the  initial 
volues  for  the  simultaneous  equations  os  obtained  from  Tables  111  and  V,  and  the  rela¬ 
tionship  of  the  fVincipal  Axis  to  the  diagonal  factors.  As  a  computational  check,  the 
sum  of  the  squares  of  the  a^ .  p 's  should  equal  1 .0  (within  rounding  error).  This 

demonstrates  that  the  Principal  Axis  Vector  (obtained  In  Step  5)  is  wholly  within  the 
X-spoce,  and  must  therefore  be  completely  predictable  from  the  g-1  X-Vorlobles. 
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Table  VI 


The  Beginning  Values  for  fheg~1  Simultaneous  Equotions  be  Solved  to  (%tain 
the  Relationship  of  the  Mncipal  Axis  to  the  Diagonal  Factors. 

Final  Solution  is  Shown  In  the  Row  Labeled  a^^  p 


Coefficients  for 


1l 

oy> 

V 

Of  p 

s 

‘*PYi 

-.07235 

-.02135 

-.11552 

.15839 

.00126 

.02492 

=  -.18127 

-.11255 

.26920 

.11692 

-.07960 

-.03003 

.07423 

=  .29501 

.14078 

-.06813 

-.14393 

.18450 

.05983 

-.10287 

=  -.30391 

-.08916 

.23753 

.11757 

-.02682 

-.05377 

.18762 

=  .28791 

.11519 

-.02468 

-.08230 

.11311 

.03746 

-.02057 

=  -.16924 

.03773 

-.04738 

-.00275 

-.09817 

-.02135 

.01455 

=  .01475 

af.p-.46213 

.44628 

.47222 

-.47346 

-.14697 

.34421 

m=fj 


Step  7. 

Using  the  loodings  of  the  A’incipol  Axis  on  the  diagonal  factors  as  the  criterion 
variable,  solve  the  g"l  simultaneous  equations  necenory  to  obtain  the  standard ~score 
beta  weights  of  the  g**!  X'variables  used  as  the  basis  for  the  diagonal  factors.  The  g**1 
set  of  simultaneous  equations  for  step  7  is  of  the  form 

V.  •  ^  "  V.  •  "-S  *  -  *  Vg-i  •  Vi  ■  %•> ' 

^-^Vg-r  Vi'  V' 

•ee^eee  eee 

Of  .  Pvm  ~  .  P  •  0? 

tg-lXg-]  tg-]P 

The  coefficients  for  the  X's  for  use  in  the  above  equations  are  the  diagonal 

factor  loadings  of  the  first  g-1  X'variabies  as  shown  In  Table  III.  The  a^  p's  ore  the 

• 

solution  to  the  simultaneous  equations  solved  in  Table  VI.  The  results  of  step  7  are 
shown  in  Table  VII. 


k 


A 


Table  VII 


The  Standard  Score  Beta  Coefficients  's)  for  the  g"l  X"Varlable$  Which  Perfectly 

‘ 

IVedict  the  Vector  Tlvou£^  the  X  Space  Capable  of  Exptoining  the  Most  Varicetce 


Os 

-.38191 

.46194 

.37010 

-.48376 

-.61559 

.36691 

Step  8. 

Obtain  the  ratv'^ore  beta  coefficients  for  the  g~1  X~variabies  upon  which  the 
diogonol  factors  are  based.  This  is  accomplished  by  talcing  the  mean  of  the  Principal 
Axis  Vector  to  be  zero  and  taking  its  variortce  to  be  unity.  Row  score  beta  for  the 
X^vor table  becomes 


1.00 


/nx^(N-nx^) 

F3 


/ 


N 


VnXj(N-nx^) 


(13) 


Obtain  the  constant  to  be  added  (A)  by  the  equation 


A 


=  .  X,) 

g-1 

*  "Xj 

N 


0.0 


(14) 


Tcdjie  VIII 


The  Raw  Score  Beta  Coefficients  (bj^^  ^)  for  the  g~1  X'Variobles 
and  the  Constont  to  be  Added  (A) 


A 

-.90867 

3.275 

.82315 

-1.5188 

-.16125 

2.30387 

.06111 
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step  9, 

The  values  in  Table  VIII  give  weights  to  use  in  a  prediction  equotion  of  the 

form 

P  =  -X^  +  ...+b^^_^-XgH  +A.  (15) 

If  we  consider  the  predicted  score  of  o  person  in  the  jth  x  cotegory,  the  only  X“ 
vorioble  to  have  a  nonzero  score  will  be  X^  ,  which  will  hove  a  value  of  1 ,0,  There" 
fore,  the  predicted  sc<»e  (I*®*  /  location  of  case  on  the  Principol  Axis  vector)  for  o 
person  who  was  in  the  ith  category  of  the  X~variabie  will  be 

=bjj^  (0)  +  b^(0)  +  ...  +bj^^  (1 )  +  ...  +bxg.^  (0)  +  A, 

which  simplifies  to 

Xi 

For  a  case  which  was  in  the  g*^^  category  of  X,  the  predicted  score  will  be  simply 
P„  =  A.  These  scores  represent  the  locations  of  the  various  X  categories  on  a  vector 
9 

through  the  X"Space  which  must  be  optimally  related  to  the  variances  of  the  Y“variables. 
The  meon  of  the  scores  will  be  zero  and  the  standard  deviation  will  be  one.  For  this 
reason  we  refer  to  these  predicted  scores  (Px'»)  «  the  z"welghts  (i.e, ,  z^^  =  Px  ). 

Table  IX  shows  the  computed  z-weights  based  on  Eq.  (16).  *  ^ 

Table  IX 


The  Computed  z'Welghts  for  the  g  X”Variables 


^2 

z 

^4 

*5 

^7 

-.84755 

3.78582 

.88426 

-1 .45766 

-.10014 

2.36498 

.06111 

Step  10. 

Inasmuch  os  the  X  categories  have  been  ordered  along  a  single  continuum,  we 
may  consider  X  to  be  an  interval-type  variable.  The  optimal  set  of  weights  for  the 
Y  categories  may  be  obtained  by  Wherry's  0)  multiserial  equation  which  states 

which/ for  our  purposes,  may  be  revwitten  as 
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r(»x.  •  Cl.) 


wh«re: 

by  =  the  mulHserial  weight  for  the  j*"  Y  category, 

th 

z„  =  the  x"welght  for  the  i  X  category, 

e 

c  =  the  numf^  of  cases  in  the  X  category  which  vi^e  also 
in  the  Y  category,  and 

n^  =  the  total  number  of  coses  in  the  Y  category. 

The  multiserioi  weights  for  the  Y  cctegorles,  os  computed  by  Eq.  0^# 
shown  in  Table  X. 

Table  X 

The  Multiserial  Weights  for  the  Y  Categories 


-.86933  i. 01338  -.39234  1.76070  -.65546  .03737  .  72035  ,26810 


Step  11, 


Obtain  the  K -coefficient  by  the  equation 


h 

Z  b»  .  nj 


for  the  present  problem  K_  has  a  value  of  .56219. 

Step  12. 

This  Final  step  is  optional  but  may  be  accomplished  if  z-weights  ore  desired  for 
the  Y  variables  instead  of  multiserial  weights.  The  z-weights,  when  opplied  to  the 
entire  sampie,wiil  yield  a  mean  of  zero  and  a  standard  deviation  of  unity.  The  z- 
vveights  for  the  h  Y*wariobies  are  obtained  by  applying  the  equation 

*v  “  /  l^xv  •  0^ 


The  calculated  z-weights  for  the  Y-variables  are  shown  in  Table  XI. 


Table  XI 


The  z”Weighh  for  the  h  Y“Varioblei 


*yi 

^2 

^3 

*y5 

z 

^6 

V7 

Vs 

-1.54632 

1.80256 

-.69788 

3.13186 

-1 .165992 

.06630 

1.28133 

.47688 

Proof  that 


Is  a  Peonon -Moment  Correlation  Coefficient 


Using  the  z-wetghts  for  the  g  X-calegories  anJ  the  h  Y~categorles  will  yield 
means  of  zero  and  standard  deviations  of  unity  for  both  X  and  Y.  Under  such  circum¬ 
stances  the  Peonon  product-moment  correlation  coefficient  may  be  computed  as 


N 

■“XY  “  * 

N 


The  above  equation  may  be  rewritten  os 

h  9 

rxY  '  ') 


N 


N 


(20) 


However,  from  Eq.  07)  we  know 


therefore,  substituting  this  value  into  Eq.  (20)  we  obtain 

^XY  - - - - - 


N 


(21) 


Now,  substituting  Eq.  0^)  (20)  we  obtain 


XY  — J_ 


N  .  K 


XY 


(22) 
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•  # 


From  Eq.  (18)^ 


thot 


F4 


Hiarofor*/  lubftihjfing  this  value  into  £q.  (22)  we  tee 


^XY 


K* 

=  XY 

•^XY 


t 


w^iich  proves  the  K^coefflcient  is,  in  fact,  a  f^eceson-type  measure  of  eorrelotton  fsr 
cotegoricol  data. 

The  data  used  In  the  obove  exomple  ware  obtained  by  randomly  combining 
various  adjacent  coiuenns  and  vorlous  odfocani  rows  of  some  "interval "  type  data  on 
"heigSis  of  fothers"  (X)  and  "height  of  sons"  (Y)  as  found  on  page  117  of  McNemar  (9), 
The  data  weie  combined  as  shown  in  Figure  1 . 

The  z"weights  derived  by  the  procediMe  deeio'ibed  in  this  paper,  when  edited  in 
subsequent  somples,  would  not  be  expected  to  yield  a  correlation  this  high.  A  poper  on 
the  expected  "dvinkage*  of  the  K^coefficient  is  being  ptepared  as  well  as  o  paper  on 
testing  the  significance  of  the  K^coefficient.  Until  the  latter  paper  is  published  users 
of  the  K*%oefficient  may  use  the  X  *  -test  associated  with  the  contingency  coefficient 
to  decide  if  obtained  K^ooefficients  differ  significantly  from  zsho. 

Initial  investigations  of  the  distribution  of  K~coefficients  indiccte  that  a 
K'ooefficient  may  be  less  than,  os  well  as  greater  than,  the  corresponding  C” 
coefficient. 

The  obtaining  of  a  K'xoefficient,  especially  when  the  number  of  rows  and 
columns  is  large,  is  obviously  a  tedious  process  and  one  which  should  be  handled  by 
computer  rather  than  desk  calculator.  The  technique  has  been  programmed  for  the  BM 
1620  computer  and  will  handle  problems  where  g  h  $  20.  In  spite  of  the  complexity 
of  the  technique,  the  K~coefficient  seems  for  more  ocoeptable  os  the  proper  measure  of 
relationship  between  oategorical  variables  than  the  older  contingency  coefficient. 
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Figure  1 .  The  Original  Scatteiplot  of  the  Data  from  which 
Table  I  wot  Obtained 
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